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ABSTRACT 

Toeplltz  matrices  are  shown  to  be  a  special  class  of  a  more  general  class 
of  matrices,  called  herein  "masked  matrices",  whose  elements  satisfy  a  two 
dimensional  linear  recursion.  An  explicit  matrix  inverse  for  doubly  infinite 
matrices  satisfying  a  2X2  mask  Is  derived.  The  well  known  explicit  inverse  of 
doubly  infinite  Toeplltz  matrices  is  a  special  case.  It  is  shown  that  the 
Inverse  of  a  masked  matrix  is  a  masked  matrix. 

Algebraic  properties  of  masked  matrices  are  also  examined.  In 
particular,  we  Introduce  an  operation  by  polynomials  on  masked  matrices  and  we 
show  that  this  operation  Induces  a  decomposition  of  masked  matrices  into  other 
masked  matrices  which  are.  In  a  sense,  simpler. 
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I.  INTRODUCTION 


A  doubly  infinite  Toeplitz  matrix  is  an  array  of  complex  numbers 
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t 

col  0 
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(1) 


having  the  property  that  every  entry  on  any  given  diagonal  is  identical.  To 
construct  a  general  Toeplitz  matrix,  start  with  the  doubly  infinite  sequence 


• . .  .  c_2#  c_i ,  Cg.  C-|  ,  C2» 


(2) 


and  then  form  the  associated  matrix  Tc  =  [tn  m]  with  tn  m  =  cn_m.  The 
central  question  is:  Can  the  system  of  equations 


Tc  x  =  b  (3) 

be  solved  uniquely  for  the  sequence  x  =  (xn),  given  the  sequence  b  = 

( bn ) ?  This  question  Is  equivalent  to  asking  when  the  matrix  Tc  is 
invertible. 

Define  the  function  f(e)  to  be  the  Fourier  series 

QO 

f(e)  =^cneine  .  (4) 

n=-<» 

Toeplitz  [1]  proved  that  the  matrix  Tc  is  invertible  if  and  only  if  the 
function  l/f(e)  Is  essentially  bounded.  By  definition,  a  function  g(e)  is  NOT 
essentially  bounded  if  and  only  If  each  of  the  sets 

e  :  |g(e)|  >  kj  k=l ,  2,  ... 

has  positive  measure.  Toeplitz  also  gave  the  following  explicit  formula  for 
Tc1 •  The  Fourier  series  expansion 
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(5) 


gives  Tc1  =  [f n_m3 ,  that  Is, 
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This  result  Is  known  as  the  Toeplltz  Inversion  Theorem  [1,2]  and  will  be  shown 
to  be  a  special  case  of  Theorem  1.4  below. 


The  purpose  of  this  paper  Is  to  explore  extensions  of  Toeplitz's  basic 
and  Important  results  to  a  more  general  class  of  matrices  that  we  call 
"masked"  matrices.  In  words,  a  doubly  infinite  matrix  T  =  [tn>m]  is  said  to 
be  a  masked  matrix  If  there  exists  a  complex  (p+l)x(q+l)  matrix  M,  with  p>l 
and  q>l,  which  Is  orthogonal  to  every  rectangular  submatrix  formed  from  T  by 
eliminating  all  but  p  consecutive  rows  and  q  consecutive  columns.  Let  M  = 

[an  k].  J=0 . p  and  k=0,  ...,  q.  Then  M  is  a  mask  for  T  =  [tnm]  If 

ana only  If 


P  Q 


j,k  n+j,m+k  =  0  for  n,  m  =  0,  +1 ,  +2, 


j=0  k=0 

where  “  denote  conjugation. 

Suppose  the  matrix  T  has  the  2x2  mask 


(7) 
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Then,  from  (7), 


1  1 

£  ajk  tn+j ,  m+k  =  Vm  "  tn+l  ,m+l  =  0  141  * 

k=0  j-0 

Consequently,  any  matrix  T  having  the  mask  (8)  must  be  a  Toeplitz  matrix. 
Conversely,  If  T  Is  Toeplitz,  It  has  the  mask  (8). 

Note  that  a  Toeplitz  matrix  also  has  the  3  x  3  mask 

1  0  o' 

0  0  0 

0  0-1. 


However,  not  every  matrix  satisfying  this  3x3  mask  Is  necessarily  Toeplitz. 
This  points  out  the  fact  that  a  given  mask  M  can  be  used  to  characterize  a 
class  of  matrices.  As  is  easy  to  see,  the  class  of  all  matrices  satisfying  a 
given  mask  M  forms  a  linear  subspace  of  the  space  of  all  matrices.  The 
dimension  of  this  linear  subspace  is  countably  infinite  for  any  mask  M. 

Consider  the  general  2x2  mask 


where  a,  b,  c,  and  d  are  known  constants.  If  the  matrix  T  =  [tnm]  satisfies 
this  mask,  then  from  (7)  •  J 


+  b  t 


n,m+l 


+  c  Vi-l  ,m  +  d  tn+l  ,m+l 


(10) 


If  the  mask  (9)  is  the  Toeplitz  mask  (8),  then  knowledge  of  one  entire  column 
of  T  suffices  to  predict  all  of  T.  Is  this  still  true  for  general  2x2 
masks?  The  answer  is  yes,  subject  to  certain  conditions,  as  the  following 
Theorem  shows. 

Theorem  1.1 .  Given  any  column  of  a  doubly  infinite  matrix  T,  and  given  that  T 
satisfies  the  general  2x2  mask  (9),  then  T  is  uniquely  determined  if  and 
only  if  both  the  following  conditions  hold: 


(1)  |a|  *  IcI  and  either  a  *  0  or  c  *  o 

(2)  jbf  jt  Id  I  and  either  b  *  0  or  d  *  0. 
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Proof ;  Let  the  given  column  of  T  be  the  m-th  column.  From  (10),  the 
equations  for  column  m+1  are 
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Both  matrices  are  clearly  Toeplitz.  By  the  Toeplitz  Inversion  Theorem  stated 
above,  the  matrix  on  the  left  has  an  inverse  if  and  only  if 


f(0)  =  b  +  d  e^e  ^  0  "for  any  ©  • 

Since  f(e)  *  0  if  and  only  if  |bj  *  ld|  and  either  b  *  0  or  d  *  0,  this  system 
can  be  solved  uniquely  for  column  m+1  of  T.  Similarly,  column  m+1  of  T 

determines  column  m  if  and  only  if  a+ce  *0.  By  an  obvious  induction  it 

follows  that  all  of  T  is  uniquely  determined.  This  completes  the  proof. 


Define  the  doubly  infinite  Toeplitz  matrices  Mj  by 
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col  0 

By  Toeplitz's  Inversion  Theorem,  the  matrices  Mj  are  invertible  if  and  only 
if  the  functions  *  . 

P 

fj(9)  =  ^an,j  ein9  •  J  -  °.  1 . Q.  (12) 

n=0 

have  no  zeros  for  real  values  of  ©;  •  i.e.}  ^(0)  to  -for  Any 

Theorem  1.2:  Let  M  =  [a^]  be  a  (p+1)  x  (q+1)  mask  for  the  matrix  T.  Let 
Cn  denote  the  n-th  column'  of  T,  for  n=0,  +1,  +2,  ...  .  Then 

q 

y  Mj  Cn+j  =  0  ,  n  =  0,  +1,  ±2,  ...  (13) 

j=0 

where  Mj  are  the  Toeplitz  matrices  (11). 


Proof :  Fix  n.  The  definition  (7)  now  gives  a  system  of  equations,  one  each 
for  m  =  0,  +1,  +2,  ...  .  This  system  is  (13).  This  concludes  the  proof. 
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Corollary  1.1.  Any  q  columns  of  the  matrix  T  determines  all  of  T  If  none  of 
the  functions  fj(e),  defined  by  (12),  have  zeros  for  real  e. 

Proof :  The  matrices  Mj  are  Invertible  under  the  conditions  stated.  Case 
1:  suppose  that  columns  0,  ....  q-1  are  known.  Then 


Cn+q 


which  implies  that  Cn+g  may  be  found  from  Cn,  Cn+j,  ....  Cn+q_-| .  This 
fills  out  the  right  half  of  T,  and  similar  reasoning  fills  out  the  left  half. 

Case  2:  Suppose  that  columns  0 . t-1 ,  t+1 . q-1  are  known.  Then  (13) 

can  be  used  to  find  column  t.  This  reduces  the  problem  to  the  first  case,  and 
so  concludes  the  proof. 

We  remark  that  Corollary  1.1  Is  a  generalization  of  Theorem  1.1. 

An  important  observation  is  that  the  vector 


q-1 


(M 


-1 


j=0 


V 


Cn+j  •  311  n* 


(14) 


Ee  *  (...,  e"i2®,  e-ie.  1,  e^®,  e12e,  ...)t  (15) 

is  an  eigenvector  of  every  matrix  Mj,  defined  by  (11),  and  that  the 
eigenvalue  corresponding  to  Ee  is  fj(e),  defined  by  (12).  Thus 


Mj  Eq  =  fj(e)  Ee. 

Because  Mj  Is  Toeplitz,  we  also  have 

4  Hj  ■  V9>  Ee  ' 

A 

The  proof  of  this  fact  Is  a  straight  forward  verification. 

o' 

Now,  multiplying  (13)  on  the  left  by  E*  gives 


(16) 


(17) 


q 

j-o 


0  =  (EJ1  M.  C  . 
e'  7  J  n+j 
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)  C 


n+j 


-2>e"j 

j=0 

q 

'  E  Fj(e>  4  Cn+J 
J“0 

q 

-£fj<e>(Ee  C„+J) 

J-o 


■Efj 


(8)  Fntj(8) 


j=0 


where  we  have  defined  the  functions 


Fn<*>  "  C 


OO 


k=-^o 


We  stress  that  the  existence  of  Fn(e)  defined  by  the  Fourier  series  (18)  is 
an  assumption.  Although  the  Corollary  1.1  proves  that  each  column  of  T  exists 
uniquely,  we  have  not  shown  that  each  column  gives  a  convergent  Fourier  series 
(18).  We  have  proved  the  following  result. 

Theorem  1.3:  If  M  is  a  (p+1)  x  (q+1)  mask  for  the  matrix  T,  and  if  every 
function  Fn(e)  defined  by  (18)  is  convergent,  then 


q 

y~V(e)  Fn+j(e)  =  0,  n=0,  ±1,  ±2 .  (19) 

j=0 
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where  fj(0)  Is  defined  by  (12). 
Corol larv  1.2:  If  p  =  q  =  1, 

■  -[;  s] 

and 

fo(e)  =  a  +  c  e^e  *■  0 
f ! (©)  =  b  +  d  eie  *  0 

then 


Fn(e)  =  (-l)n  0  FQ(0),  n=0*  -1  *  ±2  *  ••• 

Proof:  From  (19),  for  n  >  0, 

fQ(e)  Fn(e)  +  f-|  (e)  Fn+1(0)  =  0 
so  that 


Fn+1 

which  implies 


half  of  (20).  Similarly, 


F-n-l(e)  +  fl(e)  ^n(e)  =  0 


or 


(20) 


This  completes  the  proof. 


•Corollary  1.3:  Suppose  there  does  not  exist  any  value  of  0  such  that 

Fn(e)=0  For  n-  Define  the  trigonometric  polynominal 
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ffakJ  .1(^’  .  (21) 

j=0  k=0 

Then,  for  every  ©  there  exists  0  such  that  D  (0,0)  =  0. 


Proof:  Fix  ©.  From  (19)  we  get  the  Toeplitz  system 


.  f0(e)  f-|  (e)  f2(e)  ...  fq(e)  0  0 

F_i(e) 

row  0  **> 

.  0  f  0(e)  f](e)  ...  fq_i (©)  fq(0)  0 

.  0  0  f0(e)  ...  fq_2(e)  fq_i (e)  fq(e)  . 

a  |  •  •  .  •  •  •  • 

A 

Fo  (®) 
Fn  (©) 

L  t 

col  0 

“  m, 

By  Toeplitz  Inversion  Theorem  this  system  has  a  unique  solution  if  and  only  if 
D(e,0),  as  a  function  of  0,  does  not  vanish.  But  if  D(e,0)  *  0  for  any  0, 
then  Fn(e)  =  0  for  all  n,  since  the  right  hand  side  of  this  Toeplitz  system 
is  identically  zero,  contradicting  the  hypothesis.  Thus,  there  exists  0  such 
that  D(e,0)  =  0.  This  completes  the  proof. 

We  can  solve  formally  the  general  system  T  x  =  b  for  x  when  the  matrix  T 
has  a  2  x  2  mask,  that  is,  p  =  q  =  1.  The  next  theorem  gives  an  explicit 
matrix  inverse  for  this  case.  The  problem  for  other  values  of  p  and  q  is  more 
difficult. 

Theorem  1.4:  Let  p  =  q  =  1,  and  let  the  mask  M  be  as  in  Corollary  1.2  for  the 
matrix  T  =  [tnm].  Suppose  that  T-1  =  [sn  m]  exists  and  that,  for  every 
m,  the  function 


n=-oo 


is  defined  on  the  unit  circle  r  and  on  the  set 
L_1 (r)  =  jze  C  :  z  =  L_1(w),  wcr 

where 


a  +  b  z 


,  ad 


be  *  0. 


L(z)  = 


c  +  d  z 


(21) 
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Suppose  that 


<30 


(1)  FQ(z)  =  'y  ^  tn  0  zn  exists  and  is  defined  on  rand  L(r) 

n=-» 

(2)  l/Fo(L(e^®))  is  essentially  bounded  for  all  real  e 


(3)  p-^y  =y  ^  f zK  exists  and  is  defined  on  the  set  L(r) 
Then,  the  general  entry  sn<m  of  T“1  is  given  explicitly  by 


VZ  \  Un H-k.n) 


k=-oo 


where,  for  all  integers  y  and  v,  we  define 


(22) 


T/  \  JL  f  /a  Z  +  bV  d  z 
I(y,v)  -  2iri  J  \c  z  +  d)  z  v  +  1 


(23) 


Proof:  Consider  the  general  system  T  x  =  b  or,  equivalently, 


4*0 


£  *- 

k=-«o 


k  xk  '  bn  •  n-°'  ±’-  •"  • 


(24) 


Let  z  be  a  complex  variable.  Multiply  both  sides  of  (24)  by  zn  and  sum  over 
n  to  get 


00 

00  ^  .  n  V"  >  u  n 

v— v  v— v  t  .  x,  z  =X  b  z  s 

I  E  n,k  1  2-  n 


B(z)  . 


(25) 


n=-«o  k=-«° 


ns— oo 


All  series  are  assumed  to  be  absolutely  summable,  so  interchange  the  order  of 
summation  to  get 
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B(z) 


Fk(z) 


where  (see  (18)) 


Fk(z) 


By  Corollary  1.2,  we  have 


BU>  -  F0(2)£xk  (-§^H)k  _ 
k-~  oo 

Because  ad  -  be  *  0,  L-1(x)  exists  and  equals 


L_1(z) 


a  4-  c  z 
b  +  d  z' 


Hence,  from  (27),  we  write 


. 

B(z)  =  F0(Z)  ^xk  [l_1  (Z)]  k 
k=-*>o 


where 


(26) 


(27) 


(28) 


(29) 


oo 

X(z)  zk  * 

Ka-<» 


.  (30) 
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We  will  use  X(z)  only  for'sequences  (x|<)  which  are  the  columns  of  T-^ ;  by 
hypothesis,  then,  we  have  that  X(z)  is  defined  on  the  sets  L-1 (r)  and  P. 
Since  B(z)  and  Fg(z)  are  assumed  to  be  defined  on  the  set  L(r),  we  have 


X(z)  =  . 

Fq( L( z) ) 

Using  (30)  and  (31)  gives  the  n-th  coefficient  of  X(z)  as 


x 


n 


2w  J  Fo(L(e"10)) 


-Ine 

e 


de. 


By  assumption 


•  1 

F(L(e1e)) 


(L(ei0)) 


k 


is  essentially  bounded  for  all  e,  so 


xn  2tr^fk 


To  find  the  general  entry  snm  of  T-1 ,  we  take  B(z)  =  zm,  corresponding 
to  choosing  the  vector  b  to  4iave  all  zero  components,  except  the  m-th 
component  which  is  1.  With  this  choice  of  b,  we  have  xn  =  s^  and 


1  r  (  a  +  b  e~ 

sn,m  -  2.^fk  /('  c  +  d  e- 

=  -eo  — Tf 


1er  -Ine 
iej  e  de 


/ 


B(L(e  i0)) 


,  -ie.  Nk  -ine 
(L(e  ))  e  de 
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=  2>1)m+k  fk  I(m  +  k,  n) 
f/us  i~k<s.  ptoo'f'. 


As  an  Interesting  corollary  to  this  result,  we  are  able  to  obtain  a  mask 
for  the  Inverse  of  a  masked  matrix. 


Corollary  1.5:  Let  T  and  T_1  be  as  In  theorem  1.4. .  If  M  = 

,-l 


=  a  bl 
c  dj 


is  a  mask  for  T,  then  M  = 


a  c  i 
-b  d. 


s  a  mask  for  T 


Proof;  Before  we  begin  this  proof.  It  is  worthwhile  to  note  that  conjugates 
appear  in  “Pf  but  disappear  in  the  proof  since  the  definition  of  a  mask 
requires  us  to  take  conjugates  before  applying  the  mask.  According  to  theorem 
1.4,  we  may  represent  snifn  as 


n,m 


^  (-l)m+k  fk  I(m+k ,  n) 


k=-°o 


Appl yi»v^  irUe,  ma.sk  M 

as  +cs  ,+bs,  +  d  s  ,  . 

n,m  n,m+l  n+l,m  n+l,m+l 


■£f- 


l)m+k  fk  a  I(m+k,  n)  +  (-])m+1+k  f  c  I(m+l+k,  n) 


k=-« 


+  {-l)m+k  ffc  b  I ( m+k ,  n+1 )  +  (-i)m+1+k  ffc  d  I(m+l+k,  n+1  )> 


^  ^  (_l)m+k  |a  i(m+k,  n)  -  c  I(m+l+k,  n)  +  b  I(m+k,  n+1) 


k=-o 


d  I(m+k+l 


•  "+1)| , 
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The  sum  within  the  braces  reduces  to  an  integral  over  r  which  has  the,  integrand 


Now  padding  fractions,  we  find  the  numerator  of  the  bracket  is 
a  z(cz  +  d)  -  c(az  +  b)z  +  b(cz  +  d)  -  d(az  +  b) 

=  (ac  -  ca)z2  +  (ad  -  be  +  be  -  da)z  +  bd  -  db 
=  0  . 

Thus,  the  mask  of  the  inverse  is  the  Hermitian  transpose  of  the  original  mask. 


Toeplitz's  Inversion  Theorem  is  also  a  corollary  of  this  result.  With 
the  mask  (8) ,  we  have 

I(v.  *)  -  5 il/  <-l)y  z'l_V"1  dz 
r 


where  6  is  Kronecker's  delta.  Hence,  from  (22), 

=  fn-m 


exactly  as  given  by  (6) . 
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II.  Masked  Matrices  and  Polynomials 

In  this  section  we  will  explore  a  connection  which  exists  between  masked 
matrices  and  polynomials  In  two  variables  over  the  complex  numbers.  When  we 
have  developed  this  connection  we  will  be  able  to  exploit  the  well  known 
arithmetic  of  polynomials  to  further  our  understanding  of  masked  matrices.  In 
order  to  do  this,  however,  we  need  to  establish  some  notation  and  to  fix  some 
Ideas.  Throughout  this  section,  C  will  denote  the  complex  numbers  and  Z  will 
denote  the  integers.  First,  we  state  formally  our  definition  of  a  masked 
matrix. 

Definition  II. 1 :  Let  T  =  [tn.m]  n  meZ  be  a  doubly  Infinite  matrix 
over  the  complex  numbers.  Let  M  =  tajcjl*  *  =  0,  ... ,  p,  j  =  0,  ...»  q  be  a 
matrix  of  complex  numbers.  We  say  M  Is  a  matrix  mask  for  T  If  and  only  if 


Q  P 

£  X)  5k-j  Vk.s.j  ■  °  for  ^  -  °>  ±’-  ±2'  ••• 

j*0  k=0 

where  “  denotes  complex  conjugation. 

Intuitively,  In  order  to  decide  whether  or  not  M  is  a  mask  for  T,  we 

place  M  on  T  with  the  element  a_  A  on  top  of  t  ,  aA,  on  top  of  t  •  , 

0,0  rs  01  r+1  ,s 

etcetera.  Then  we  multiply  ak  j  by  tr+k  $+j  and  add  the  products.  If  that 

sum  is  zero  for  every  choice  of  r  and  s,  then  M  is  a  mask  for  T. 

Let  us  now  consider  a  more  general  action  of  M  on  any  complex  doubly 
Infinite  matrix.  From  now  on,  DM  denotes  the  complex  vector  space  of 

all  doubly  infinite  matrices.  If  M  is  a  p  +  1  by  q  +•  1  matrix  over  C,  we 
define  a  linear  operator  DM  -*  DM  as  follows.  If  T  c  DM,  then  the 

(r,s)  entry  of  <?H(T)  is  given  by 


q  P 

yi  yi  ak,j  v+k.s+j 

k=0  j=0 


for  all  r,s  =  0,  +1 ,  +2,  . . . 


Intuitively,  In  order  to  calculate  the  (r,s)  entry  of  <^M(T),  we  place 

M  on  T  with  the  element  a_  A  on  top  of  t  ,  aA  ,  on  top  of  t 

0,0  K  r,s  0,1  K  ^  1 

etcetera.  Then  we  multiply  ak  ^  by  tr+k  $+j  and  add  the  products.  We 

state  some  of  the  properties  of  ^  in  the  following  theorem  whose  proof  is 
obvious . 
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Theorem  II.l:  Let  M  be  a  p  +  1  by  q  +  1  matrix  over  C  and  define 
as  above.  Then  ^  is  a  linear  transformation  on  DM.  If  T  e  DM,  then  M  is 
a  matrix  mask  for  T  if  and  only  If  T  Is  in  the  kernel  of  3^. 


As  an  example,  consider  the  Toeplltz  mask 


M  = 


-1 

0 


0 

1 


If  T  is  any  element  of  DM,  then  the  (r,s)  entry  of  ^m(T)  Is  tr+i  s+-|  - 
tr.s* 


Some  facts  about  masked  matrices  follow  Immediately  from  the  theorem. 

For  example,  since  the  kernel  of  is  a  subspace,  if  M  is  a  matrix  mask 
for  both  T-|  and  T2  then  M  must  be  a  matrix  mask  for  T-|  +  T2. 

Furthermore,  ^  Is  zero  If  and  only  if  M  is  an  entirely  zero  matrix. 

Some  obvious  properties  of  are  worth  keeping  in  mind.  For 
example,  -j]  merely  shifts  every  doubly  infinite  matrix  one  column  to  the 

upward.  The 


5?h  if  and  only 


If  M-j  and  M2  are  obtained  from  some  matrix  M3  by  the  concatenation  of 
rows  of  zeroes  at  the  bottom  and  columns  of  zeroes  on  the  right. 

Now  we  shall  adopt  a  new  point  of  view  which  will  allow  us  to  bring  In 
some  more  mathematical  machinery.  Suppose  we  have  a  p  +  1  by  q  +  1  matrix  M  ■ 

[a^j],  k  =  0,  ....  p,  j  =  0,  ....  q.  Then  it  is  possible  to  interpret  the 

k-th  row  of  M  as  a  polynomial  f|<(x)  where  the  coefficient  of  xJ  is  a^j. 

We  may  then  Interpret  the  entire  matrix  as  a  polynomial  in  two  variables  x  and 
y  where  the  coefficient  of  yk  is  f|<(x). 

For  example,  if  M  Is  the  Toeplltz  mask,  fg(x)  =  -1  •  x°  +  0  •  x  =  -1 

and  f -j ( x )  =  0  •  x°  +  1  •  x  =  x,  so  the  entire  polynomial  is  xy  -  1 . 

Similarly,  if  we  consider  the  Hankel  mask. 


left  while  ^i-q-i  shifts  every  doubly  infinite  matrix  one  row 

LlJ 

linear  transformation  is  the  Identity.  Finally,  ^  = 


M  = 


0 

1 


we  get  the  polynomial  y  -  x.  The  row  matrix  [0 

m 


column  matrix 

ll 


corresponds  to  y. 


1]  corresponds  to  x  while  the 


Conversely,  if  we  are  given  a  polynomial  in  x  and  y,  we  get  a  matrix  as 
follows.  If  the  degree  of  the  polynomial  in  x  is  q  and  the  degree  of  the 
polynomial  in  y  is  p,  we  will  have  a  p  +  1  by  q  +  1  matrix.  The  entry  in  row 
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r  and  column  s  in  the  coefficient  of  xsyr.  For  example,  the  polynomial 
1  +  x  +  xy  gives  rise  to  the  matrix 


1  1 
0  1 


We  should  note  that  the  correspondence  between  matrices  and  polynomials 
is  not  quite  one  to  one.  That  Is,  if  we  start  with  a  matrix,  derive  a 
polynomial  and  then  derive  a  matrix,  we  might  not  end  with  the  original 
matrix.  That  is  because  the  original  matrix  might  have  had  a  column  of  zeroes 
at  Its  right  or  a  row  of  zeroes  at  Its  bottom.  But  either  of  these  cases 
would  be  cut  off  in  the  transition  to  a  polynomial  and  then  back  to  a  matrix. 

There  is  an  obvious  relationship  between  addition  of  matrices  and 
addition  of  polynomials  under  our  correspondence  provided  we  are  willing  to 
juggle  sizes  of  matrices  and  degrees  of  polynomials.  However,  there  is  no 
obvious  correspondence  between  polynomial  multiplication  and  the  usual  matrix 
multiplication.  We  would  be  very  surprised  if  there  were  since  polynomial 
multiplication  is  commutative  and  matrix  multiplication  is  not.  However,  we 
can  define  a  convolution  of  matrices  by  using  the  multiplication  of 
polynomials. 

Definition  II. 2:  Let  M1  be  a  p  +  1  by  q  +  1  matrix  and  let  M2  be  an 
r  +  1  by  s  +  1  matrix.  Then  M-|*M2  is  a  p  ♦  r  +  1  byq+s  +  1  matrix. 

The  (k,j)  entry  of  M-|*M2  Is  the  sum  of  all  products  m^  m^ 

where  a  +  c  =  k,  b  +  d  =  j  and  the  superscripts  1  and  2  refer  to  entries  in 

M-|  and  M2  respectively.  M,  *  is  Hie  ccnvciuh°*  M%, 

The  polynomial  corresponding  to  M^*M2  is  equal  to  the  product  of  the 
polynomials  of  M1  and  M2.  Thus,  we  see  that  convolving  M-j  and  M2 
corresponds  to  turning  M-j  and  M2  Into  polynomials,  multiplying  those 
polynomials  as  usual  and  then  returning  a  matrix  according  to  the  conventions 
we  established  before. 

For  example,  if 


-1 

0 

"0 

-l" 

Mi  - 

.0 

1. 

and 

m2  = 

_1 

q 

then  we  multiply  xy  - 

1  by  y 

-  > 

to  get 

xy2  +  (-x 

2  _ 

i)  y  + 

*0 

1 

0" 

m.,*m2  = 

-1 

0 

-1 

* 

0 

1 

0 
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Using  the  fact  that  convolution  corresponds  to  polynomial  multiplication 
we  see  at  once  that  it  Is  commutatl ve,  associative  and  distributes  over  matrix 
addition.  Furthermore,  the  1  x  1  matrix  [1]  acts  as  a  unit  for  *. 

It  is  Interesting  to  note  the  effect  of  convolving  M  with  [0  1].  Since 
the  latter  corresponds  to  the  polynomial  x,  M*[0  1]  is  the  same  as  M  except 


that  a  column  of  zeroes  has  been  added  on  the  left. 


Simi larly,  M* 


is  the 


same  as  M  except  that  a  row  of  zeroes  has  been  added  on  top. 


Now  we  can  use  the  correspondence  between  matrices  and  polynomials  to 
introduce  polynomial  operators  onto  the  space  of  doubly  Infinite  matrices. 
From  now  on,  we  use  the  notation  C[x]  to  denote  polynomials  in  one  variable 
over  the  complex  numbers  and  C[x,y]  to  denote  polynomials  in  two  variables. 


q  p 

Definition  II. 3:  Let  0(x,y)  =  ^  ^  a^  k  xky^  e  C[x,y].  Let  T 

k=0  j=0 ' 

be  a  doubly  infinite  matrix  over  C.  Define  0(x,y)T  by  the  formula 


q  P 

(»<x,y)T)5s  =  ^  ^2  5],k  Vj  stk 

k=0  j=0 

Definition  II. 4:  Let  0(x,y)  e  C[x,y]  and  let  T  e  DM.  We  say  0  is  a 
polynomial  mask  for  T  if  and  only  if  0(x,y)T  =  0  where  0  is  the  zero  matrix. 

The  theorem  below  tells  us  that  the  operation  of  polynomials  on  doubly 
infinite  matrices  actually  makes  sense. 

Theorem  I I. 2:  Using  the  above  multiplication,  DM  is  a  module  over  C[x,y]. 

Proof:  The  only  module  property  which  is  not  at  once  obvious  is  the 


q  P 

associative  law.  So  let  0(x,y)  =  \  '  \  ^  a^  k  xkyJ  and  Wx.y)  = 

k=0  j  =o 

s  r 

EE  bp  m  xm  yn  be  the  elements  of  C[x,y]  and  let  T  c  DM.  Then 
m=0  n=0 
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(0(x.y) (v|i  (x,y)T) 


EE 

v=n  i=n 


(x.y)T) 


u+j , v+k 


'EE 

k=0  i=0 


bn,m  tu+j+n,v+k+ffl] 


n=0 


q  p  s  r 

■EEEE 

k=0  j=0  m=0  n=0 


b 


n,m 


t 


u+j+n,v+k+m 


We  can  change  indices  on  the  last  sum  and  rewrite  it  as: 


q+s  p+r  / 

ELSE 

•  y=0  V=0  \(j,n)  (k.m) 

where  the  sum  In  the  middle  Is  taken  over  all  pairs  (j,n)  and  (k,m)  where 
j+k  =  p,  n  +  m  =  u.  But  the  sum  In  the  middle  is  the  coefficient  of 
xvyv  in  0(x,y)  v|i(x,y)  and  that  is  all  we  need  to  complete  the  proof. 


We  can  use  this  theorem  to  prove  some  elementary  facts  about  masks. 


Proposition  II. 3:  Let  0]{x,y),  02(x,y)  c  C[x,y]  and  T-j ,  T2  e 
DM.  Then,  If  0-](x,y)  and  02(x,y)  are  botb  polynomial  masks  for  Tj,  so 
is  0-|(x,y)  +  02(x,y).  If  0-|(x,y)  Is  a  polynomial  mask  for  T-|  and 
02(x,y)  Is  a  polynomial  mask  for  T2,  then  0-|(x,y)  02(x,y)  is  a 
polynomial  mask  for  T-j  +  T2.  Finally,  if  0-|(x,y)  Is  a  polynomial  mask 
for  T-] ,  then  0](x,y)  02(x,y)  is  also  a  polynomial  mask  for  T-| . 

Proof:  These  are  just  straight-forward  calculations.  For  example,  if 
«>l(x,y)  is  a  polynomial  mask  for  T-|  and  02(x,y)  is  a  polynomial  mask  for 
T2  then 


0-,(x,y)  02(x,y)  (T]  +  T2)  =  0-| (x,y)  02(x,y)T1  +  0-,(x,y)  02(x,y)T2 


=  02(x,y)  01(x,y)T1  +  0 


=  0  +  0 


=  0  . 
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The  other  facts  follow  just  as  easily. 

Let  us  examine  some  examples  using  the  above  theorems.  A  Toeplitz  matrix 
satisfies  xy  -  1 .  By  the  above  theorem,  it  also  satisfies  0(x,y)(xy  -  1)  for 
any  0(x,y).  If  0(x,y)  =  xy  -  1  for  example,  we  find  that  a  Toeplitz  matrix 
also  satisfies  x2y2  -  2xy  +  1.  That  is,  the  matrix 


1  0  o' 

0-2  0 
0  0  1 

is  also  a  mask  for  a  Toeplitz  matrix.  However,  not  every  matrix  which 
satisfies  this  mask  is  necessarily  Toeplitz. 

As  another  example,  consider  the  Toeplitz  mask  xy  -  1  and  the  Hankel  mask 
y  -  x.  Then,  according  to  the  above  theorem,  a  polynomial  mask  for  a  Toeplitz 
matrix  plus  a  Hankel  matrix  is  given  by  (xy  -  l)(y  -  x)  =  xy2  -  x2y  -  y  +- 
x.  That  is, 


0  1  0 

-1  0  -1 

0  1  0 

- 

is  a  Toeplitz-plus-Hankel  mask. 

Finally,  suppose  we  have  a  matrix  which  is  both  Toeplitz  and  Hankel. 
Then  it  must  satisfy  both  xy  -  1  and  y  -  x.  But,  using  the  theorem,  it  must 
also  satisfy  xy  -  1  -  x(y  -  x)  =  x2  -  1 .  So  it  must  satisfy  the  matrix  mask 
[-1  01].  Therefore,  any  row  must  look  like 


a  b  a  b  a  b  . 


By  the  Toeplitz  property,  any  other  row  must  be  a  shift  of  this  row.  Thus  any 
matrix  which  is  both  Toeplitz  and  Hankel  looks  like 


a  b  a  b  a  b 
b  a  b  a  b  a 
a  b  a  b  a  b 


We  formalize  the  above  remarks  in  a  theorem  whose  proof  is  so  routine  we  omit 
it. 
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Theorem  11.4:  Let  T  e  DM.  The  set  of  all  polynomial  masks  for  T  is  an 
Ideal  In  C[x,y].  Conversely,  if  0(x,y)  e  C[x,y],  the  set  of  all  T  e  DM  for 
which  0(x,y)  is  a  mask  is  a  submodule  of  DM. 


III.  Decomposition  of  Masked  Matrices 

Part  of  Theorem  II. 3  may  be  restated  by  saying  that  a  polynomial  mask  for 
a  sum  is  the  product  of  the  polynomial  masks  for  the  separate  summands. 
Therefore,  it  Is  natural  to  ask  the  following  question.  Suppose 
fj-|(x,y)  Is  a  polynomial  mask  for  T.  Are  there  matrices  T]  and 

T2  such  that  0-|(x,y)  is  a  mask  for  T-|,  02(x,y)  a  mask,  for  T2  and 
T  =  T]  +  T2  ?  In  general,  the  answer  to  this  question  is  no.  However,  if 
we  Insist  that  0-|(x,y)  and  ^(x.y)  be  relatively  prime,  then  the  question 
does  Indeed  have  an  affirmative  answer.  In  particular,  the  main  theorems  of 
this  section  are  theorems  III. 13,  III. 14  and  III. 17  which  formally  state  the 
manner  in  which  the  decompositions  take  place. 

In  order  to  prove  the  decomposition  we  shall  need  some  auxiliary  results 
concerning  masks  for  row  vectors.  Therefore,  we  shall  let  DV  stand  for  the 
set  of  doubly  infinite  row  vectors  over  the  complex  numbers.  If  v  e  DV,  we 

n 

shall  picture  It  as  (...,  v^ ,  vQ,  v-j ,  ...).  If  f(x)  =  ^  ^  akxk  e  C[x],  we 

k=0 

n 

define  f(x)v  by  letting  the  j-th  entry  be  ^  ^  ^kvj+k*  Then  a11  of  the 

k=0 


properties  that  held  true  for  doubly  infinite  matrices  over  C[x,y]  are  easily 
proved  for  doubly  infinite  vectors  over  C[x].  In  particular,  we  say  f(x)  is  a 
polynomial  mask  for  v  if  f(x)v  =  0. 


If  a  e  C  Is  non-zero,  we  let  a  be  the  doubly  infinite  vector  whose  k-th 
entry  is  a^.  We  let  0  be  the  doubly  infinite  vector  whose  entries  are  all 
zero.  For  non-zero  a  and  j  >  0,  let  a(j)  be  the  doubly  Infinite  vector 
whose  k-th  entry  is 

k(k  -  1)  ...  (k  -  j  +  l)(I)k_j 

Thus  a^  =  a,  and  a^  =  (...,  -2(a)  3,  -(a)  0,  1,  2a,  ...).  In  general, 

the  k-th  entry  of  a^  is  the  j-th  derivative  of  a^  with  respect  to  a. 

We  use  the  a's  heavily  in  the  following  lemmas. 
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Lemma  III.1:  Let  v  c  DV  and  a  e  C.  Then  (x  -  a)v  =  0  if  and  only  if  v 
is  a  scalar  multiple  of  a. 

Proof :  If  a  =  0,  the  theorem  is  obvious  and  we  omit  the  proof. 
Likewise,  If  v  is  ca  we  omit  the  calculation  that  (x  -  a)  ca  =  0.  However, 
suppose  (x  -  a)v  =  0.  Then  the  equation  says  that  v^+-|  =  TTv-j .  Thus 

—  —2  —  — 1  —  -2 

v.j  =  avg  ,  v2  =  a  vfl,  etc.  Furthermore,  v  ^  =  (a)  Vq,  v_2  =(a)  vQ,  ... 

That  is,  v  =  vg  a. 


Lemma  III. 2:  Let  a  c  C  be  non-zero.  Then  (x  -  a)a(J)  =  ja(J-1 ) . 
Proof:  Let  v  =  (x  -  a)a^.  Then 


Vk  =  ( k+1 ) k  ...  ((k+l)-(j+l))  (a)k+1"J’  -  a  k  (k-1)  ...  (k-j+l^M 


=  k(k-l)  ...  (k-j+2)  (I)k_J+1  ((k+l)-(k-j+l)) 

=  j  k(k-l)  ...  (k-j+2)  (i)k“J+1 
and  this  Is  the  k-th  entry  of  j  a^~^) 

We  will  use  these  to  prove: 

Theorem  I I I. 3:  Let  a  c  C  be  non-zero  and  let  v  e  DV.  (x  -  a)kv  =  0 
if  and  only  If  v  is  a  linear  combination  of  a,  a>'),  ....  a'k_1). 

Proof:  We  use  induction  on  k.  We  have  already  proved  the  theorem  true 
for  k  =  1 .  Suppose  we  know  it  Is  true  for  k  and  we  wish  to  prove  it  for  k  +  1. 

If  (x  -  a)k+1v  =  0,  then  (x  -  a)kv  is  in  the  kernel  of  x  -  a.  Thus 
(x  -  a)kv  =  ca  by  theorem  I I I . 1 .  By  theorem  III. 2,  we  see  that  (x  -  a)k 
(c/k!)  a(k)  =  ca.  Thus  v  =  (c/k!)  a,(k)  +  w  where  (x  -  a)kw  =  0.  But, 
using  the  Induction  hypothesis,  w  Is  a  linear  combination  of  a.  .... 
a(k~T).  That;  tells  us  that  v  Is  a  linear  combination  of  a,  a^'s  .... 
a(k)  and  we  are  done. 

We  have  now  successfully  determined  the  kernel  of  every  polynomial 
operator  of  the  form  (x  -  a)k.  We  would  now  like  to  determine  the  kernel  of 
every  polynomial  operator.  Since  C  is  algebraically  closed,  we  know  that 
every  polynomial  In  C[x]  factors  uniquely  as  a  product  of  a  constant  times  a 
set  of  polynomials  of  the  form  (x  -  a)k.  We  should  thus  be  able  to  apply 
theorem  III. 3  to  find  the  kernel  of  all  operators.  We  will,  however,  need  one 
more  lemma. 

Lemma  II I. 4:  Let  f(x)  =  g(x)h(x)  e  C[x]  where  g(x)  and  h(x)  have  no 
roots  in  common.  Let  v  e  DV.  Then  f(x)v  =  0  if  and  only  if  there  are  v-j 
and  V2  In  DV  with  g(x) v-j  =  0,  h(x)V2  =  0  and  v  =  v-j  +  V2» 

Proof:  The  "if"  part  of  the  theorem  is  obvious.  For  the  only  if  part, 
we  suppose  f(x)v  =  0.  We  now  recall  that  the  Euclidean  algorithm  tells  us 
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that  if  g(x)  and  h ( x )  are  relatively  prime,  there  are  polynomials  r(x),  s(x)  e 
C[x]  with  the  property  that  r(x)g(x)  +  s(x)h(x)  =  1.  Thus  r(x)g(x)v  + 
s(x)h(x)v  =  v.  Let  V2  =  r(x)g(x)v.  Then  h(x)V2  =  h(x) r(x)g(x) v  = 
r(x)g(x)h(x)v  =  r(x)f(x)v  =  0.  Similarly,  if  we  let  v-|  =  s(x)h(x)v,  then 
g(x)vT  =  0. 


Theorem  III . 5:  Suppose  p(x)  c  C[x]  where  p(x)  =  aQ(x-a.|)  - 

k 

(x-an)  n.  If  v  c  DV,  then  p(x)v  =  0  if  and  only  if  y  is  a  linear  combination 

(krl)  (kn-l) 

°f  -1  *  '  •  • '  -i  . -n  ’  *  *  *  *  -n 

Proof :  We  use  induction  on  n.  The  case  n=l  has  been  covered  in  theorem 
1 1 1. 3.  So  assume  we  know  the  theorem  is  true  for  n  and  we  will 

k  k^  k^+1 

prove  it  for  n+1 .  In  that  case,  p(x)  =  aQ(x-a,)  . ..(x-an)  n(x-an+1)  n  . 

From  theorem  III. 4  we  know  that  p(x)v  =  0  if  and  only  if  v  =  v-|  +  v2 
k  k  k 

where  (x-a  , )  n+  v,  =  0  and  (x-a^ ) . ! . (x-an)  n  v?  =  0.  But  the  induction 

(k,-l) 

step  tells  us  that-  v2  is  a  linear  combination  of  a^ ,  ...»  a -j  . 


(k  -1 ) 

a  ,  ...,  a  and  theorem  III. 3  tells  us  that  v.  is  a  linear  combination 

— n  —  n  l 

(kn+i_1 } 

of  a^  ,  ....  an+1  and  we  are  done. 

Let  us  now  proceed  from  doubly  infinite  vectors  to  doubly  infinite 
matrices  where  we  shall  make  use  of  the  preceding  theorems.  Before  we  go  too 
far,  we  should  make  a  few  observations  concerning  polynomials  in  C[x] 
considered  as  operators  on  DM.  In  the  first  place,  we  note  that  if  f(x)  e 
C[x],  then  f(x)  operates  on  elements  of  DM  in  a  row-wise  manner.  That  is, 
f ( x )  operates  on  one  row  at  a  time. 

Furthermore,  if  a  e  C,  then  we  may  "stack"  scalar  multiples  of  a  to  form 
a  doubly  infinite  matrix.  We  shall  denote  such  a  stacked  matrix  by 
c  .  a  where  we  think  of  c  as  a  doubly  infinite  column  vector  with  elements 
from  C.  (Intuitively  speaking,  c  is  the  column  vector  formed  by  picking  out 
the  elements  in  column  zero  of  c  .  a.)  Similar  remarks  apply  to  the  row 
vectors  a(J).  With  these  ideas  in  mind,  we  have  the  following  theorem  whose 
proof  is  obvious. 

Theorem  III. 6:  Let  p(x)  e  C[x]  and  let  T  e  DM  be  such  that 
p(x)T  =  0.  .Then  T  is  a  linear  combination  of  doubly  infinite  matrices  of  the 
form  c  •  a(j)  where  (x-a)j+l  is  a  divisor  of  p(x). 
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We  shall  also  need  the  following  technical  lemmas. 

Lemma  1 1 1. 7:  Let  T  c  DM  and  a  c  C.  Then  there  is  a  U  e  DM  with  the 
property  that  (x-a)U  =  T. 

Proof:  If  a=0,  we  need  only  shift  the  columns  of  T  one  unit  to  the  right 

to  obtain  U.  If  a  is  not  zero,  then  we  only  need  to  consider  the  action  of 

x-a  on  a  typical  row.  For  row  1,  the  equation  (x-a)U  =  T  states  that  u.  . 

i  ,j  +  i. 

-  a  u-j  j  =  t-jj.  If  j=0,  set  u-^o  =  0*  Now  solve  this  set  of  equations 
recursively. 

Corollary  III. 8:  Let  T  c  DM  and  let  p(x)  e  C[x] .  Then  there  is  a 
U  e  DM  with  p(x)U  =  T. 

Proof:  All  we  need  to  do  Is  factor  p(x)  into  factors  of  the  form  (x-a) 
and  use  the  above  theorem  several  times. 

The  next  lemma  is  somewhat  different  in  tone  from  the  preceding.  With  a 
little  thought,  it  is  easy  to  see  that  it  concerns  solutions  of  Toeplitz 
systems  of  equations. 


Lemma  1 1 1. 9:  Let  c-| ,  ....  cneC  be  not  all  zero.  Let  d  be  any  doubly 
infinite  column  vector  over  C.  Then  there  is  another  doubly  infinite  column 
vector  w  over  C  with  the  property 


clwk  +  c2wk+l 


+  cnVn-l  '  dk 


for  each  k  e  Z. 

Proof :  We  may  assume  without  loss  of  generality  that  c-|  and  cn  are 
not  zero.  First,  pick  any  numbers  wq,  ....  wn_-|  which  solve  the  above 
equation  for  i=0.  Now  let  k=l .  We  now  have  an  equation  which  involves  c-| , 
...,  cn,  w-| ,  ...,  wn_i  and  d-|  which  are  quantities  we  already  know. 

The  only  other  variable  is  Wp  for  which  we  may  solve.  In  this  way,  we  can 
solve  for  any  w^  simply  by  sliding  the  quantities  c-| ,  ....  cn  along  the 
vector  w  and  picking  up  one  more  unknown  at  a  time. 

We  shall  use  lemma  III. 9  in  the  proof  of  lemma  III. 10.  In  algebraic 
terms,  lemma  III. 10  states  that  certain  matrices  are  divisible  by  certain 
polynomials. 

Theorem  II I. 10:  Let  a  c  C  be  non-zero.  Let  T  e  DM  be  such  that 
k 

(x-a)  T  =  0.  Let  0(x,y)  e  C[x,yj  be  such  that  x-a  does  not  divide  d(x,y). 

Then  there  is  a  U  e  DM  such  that  T  =  0(x,y)U  and  (x-a)^U  =  0. 

Proof:  We  shall  write  0(x,y)  =  fg(x)  +  f-j(x.y)  +  •••  +  fgC’Oy01- 
Then,  the  non-divisibility  condition  states  that  not  all  of  fg(a),  .... 
f n( a)  are  zero. 
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We  now  proceed  by  induction.  Suppose  k=l .  Then  (x-a)T=0.  Thus, 

T  =  c  .  a  for  some  column  vector  c.  Consider  the  equation  0(x,y)d  .  a  = 
c  .  a  where  the  vector  d  consists  of  unknowns.  Because  of  the  way  0(x,y) 
operates  on  doubly  infinite  matrices,  we  see  that  the  (m,n)  entry  of 
0(x,y)d  •  a  is 


n+1 


a  r\  a  .  a 
0,0  m 


+  a0,l  dma 


_n+q 


+  an  _  d_a 
J,q  m 


_  _n 

+  al,0  dm+la 


_n+q 

a 


a  a  d  a 
p , 0  m+p 


+  a 


P.l 


_n+q 

a 


=  f«(a)  d 


in 


f-j  (a) 


m+1 


a" 


+  ya> 


m+p 


a" 


This  entry  should  be  equal  to  c^a11.  Thus,  we  only  need  to  solve  the  system 
of  equations 


fn<a>  dm  + 


f-,  (a) 


m+l 


fp(a) 


m+p 


=  c 


m 


for  each  m.  But  we  know  we  can  do  this  using  the  fact  that  not  all  of  the 
fo(a) . fp(a)  are  zero  and  applying  the  previous  lemma. 

Now  we  have  a  matrix  d  .  a  with  0(x,y)  d  .  a  =  c  .  a  =  T  and  (x-a)  d  OL 
=  0.  Suppose  we  know  the  theorem  is  true  for  k  and  we  wish  to  prove  it 
for  k+1 .  Consider  (x-a)^+^T  =  0.  Then  (x-a)k( (x-a)T)  =  0.  By 
induction,  (x-a)T  =  0(x,y)U  and  (x-a)^U  =  0.  Pick  a  V  with  (x-a)V  =  U. 
(Lemma  III. 7  says  this  can  be  done).  Then  (x-a)(T-0(x,y)V)  =  0.  Thus, 

T  -  0(x,y)V  =  0(x,y)W  where  (x-a)W  =  0.  But  then  T  =  0(x,y)(V+W)  and 
obviously,  (x-a)^+' ( V+W)  =  0. 

Corollary  1 1 1. 11  :  Let  T  e  DM  and  suppose  that  p(x)  e  C[x]  with 
p(x)T  =  0.  Let  0(x,y)  e  C[x,y]  with  the  property  that  if  a  is  a  root  of 
p(x),  then  (x-a)  does  not  divide  0(x,y).  Then  there  is  a  U  c  DM  such  that 
T  =  0(x,y)U  and  p(x)U  =  0. 

Proof;  Apply  theorem  III. 10  once  for  each  root  of  p(x)  to  get  the 
desired  U.  We  can  tie  the  above  together  to  get  the  following. 
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Theorem  III.12;  Let  p(x)  c  C[x]  and  0(x,y)  c  C[x,y].  Suppose  that  if 
a  is  a  root  of  p(x),  then  (x-a)  does  not  divide  0(x,y).  If  T  e  DM  has  the 
property  that  p(x)0(x,y)T  =  0,  then  T  =  U+V  where  U,  V  e  DM  and  0(x,y)V  =  0 
and  p(x)U  =  0. 

Proof;  From  Corollary  III. 11,  we  see  that  there  is  a  U  with  0(x,y)T  = 
0(x,y)U  and  p(x)U  =  0.  If  we  let  V  =  T-U,  we  have  the  desired  decomposition 

We  can  now  mount  an  assault  on  our  main  theorem:  Suppose  we  have 
T  c  DM  and  0(x,y)  and  (x,y)  In  C[x,y].  Suppose,  further,  that  0(x,y)  and 
i|>(x,y)  are  relatively  prime  and  0(x,y)i|>(x,y)T  =  0.  We  would  like  to 
decompose  T  as  U+V  where  0(x,y)U  =  0  and  *p(x,y)V  =  O'.  The  above  theorems 
show  that,  if  one  of  the  polynomials  Involves  only  x,  the  decomposition 
occurs.  Therefore,  from  now  on  we  may  assume  that  both  0(x,y)  andij»(x,y) 
involve  y  non-trivially  and  that  they  have  no  factors  which  are  polynomials 
in  x  alone. 

We  shall  need  some  facts  concerning  polynomials  in  C[x,y]  and  those 
same  polynomials  in  C( x) [y ] .  This  second  ring  consists  of  polynomials  In  y 
whose  coefficients  are  rational  functions  in  x.  If  follows  from  Gauss's 
lemma  that  irreducible  polynomials  in  C[x,y]  remain  Irreducible  when 
considered  as  elements  of  C(x)[y].  Thus,  if  0(x,y)  andi|j(x,y)  are 
relatively  prime  in  C[x,y],  they  remain  relatively  prime  in  C(x)[y].  In 
C(x)[y]  we  may  find  elements  ro(x»y)  and  5o(x*y)  with  the  property  that 
Yo(x»y)0(x.y)  +  *o(x»y)*Kx*y)  =  1 •  If  we  clear  denominators,  we  now 
have  the  equation  yi (x,y)0(x,y)  +  5-| (x,y)4<(x,y)  =  q(x)  In  C[x,y].  These 
facts  will  be  useful  in  the  following  proof. 

Theorem  III.  13:  Let  T  c  DM  and  suppose  0(x,y)iJj(x,y)T  =  0  where  0(x,y), 
Y'(x.y)  c  C[x,y]  are  relatively  prime.  Suppose  further  that  neither  0(x,y) 
nor  «Kx,y)  has  a  nontrivial  factor  which  involves  only  x.  Then  T  =  U+V 
where  0(x,y)U  =  0  and  1p(x,y)V  =  0. 

Proof :  We  make  use  of  the  equation  yi (x.y)d[x,y)  +  6-| (x,y)iKx,y) 

=  q(x).  From  Corollary  III. 8,  we  know  there  is  W  e  DM  with  q(x)W  =  T.  Then 
yi (x,y)0(x,y)W  +  5i(x,y)4i(x,y)W  =  T.  From  the  definition  of  W,  we  see 
that  q(x)0(x,y)i|j(x,y)W  =  0.  If  we  apply  Gauss's  lemma,  we  see  that,  if  a  is 
a  root  of  q(x),  then  x-a  does  not  divide  0(x,y)kKx,y) •  Thus,  we  may  find 
W]  and  W2  with  the  properties  that  W  =  W-|  +  W2,  d(x,y)i|>(x,y)Wi  =  0 
and  q ( x ) W2  =  0.  If  we  let  U  =  (x,y)4j(x,y)  W-|  and  V  =  Y]  (x,y)0(x,y)  \V, 

we  see  that  0(x,y)U  =  0  andvj>(x,y)V  =  0.  Furthermore,  U+V  =  S-|(x,y) 
^(x.ylW-)  +  Yi(x,y)0(x,y)W-| 


=  q(x)W-| 


-  q(x)(w-]+w2) 


=  q(x)W 
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The  above  theorem  allows  us  to  make  use  of  unique  factorization  in 
C[x,y]  to  get  a  more  general  decomposition. 

Theorem  1 11.14:  Let^(x,y)  e  C[x,y]  and  T  e  DM.  Suppose  Mx,y)  has 

el  ek 

the  unique  factorization ^ (x,y)  =  d^x.y)  ...  0k(x,y)  in  C[x,y]  and 
furthermore,  suppose^  (x,y)T  =  0.  Then  there  are  . Uk  e  DM 

ej 

such  that  T  =  U1  +  ...  +  Uk  and  0j(x,y)  =  0  for  j  =  1,  ...,  k 

Proof:  We  need  only  use  a  simple  induction  on  the  number  of  factors. 
Obviously,  the  converses  of  theorem  III. 13  and  theorem  III. 14  holds  also.  In 
closing  this  section,  we  should  point  out  that  theorem  III. 14  holds  also  for 
masked  matrices  of  finite  size.  However,  we  need  to  modify  the  definition  of 
a  mask  so  that  it  makes  sense  for  m  by  n  matrices. 


Definition  I I I . 1 :  Let  T  =  [tr  s],  r=l ,  ...,  m,  s=l ,  ...,  n  be  an  m  by 

n  matrix  over  C  and  let  M  =  [aj<fj],,k=0 . p,  j=0 . .  be  a  p+1  by  q+1 

matrix  over  C.  We  say  M  is  a  matrix  mask  for  T  if  and  only  if 


q  P 

y  y  vjw.s+j  =  ° 

j-0  k=0 

for  all  r  and  s  with  r+p  <  m  and  s+q  <  n. 

The  point  Is  that  we  must  ensure  that  the  mask  M  does  not  go  outside  the 

bounds  of  the  matrix  T. 

Now  suppose  we  are  given  a  matrix  T  of  finite  size  which  satisfies  a  mask 
M.  The  theorem  below  tells  us  that  we  may  enlarge  T  to  another  matrix  T1 
such  that  T1  also  satisfies  M. 

Theorem  I I 1.15:  Let  T  be  an  m  by  n  matrix  over  C  which  satisfies  M  = 

[a-j  j]  which  Is  a  p  by  q  matrix  mask.  Then  there  Is  a  column  m  vector  v 
sucfi  that  T1 ,  the  m  by  n+1  matrix  obtained  by  adjoining  v  to  the  right  of  T, 
also  satisfies  M. 

Proof:  We  picture  the  construction  of  v  geometrically  by  adjoining  a 
column  of  unknowns  to  the  right  of  T.  Now  we  think  of  placing  the  first  q-1 
columns  of  M  on  top  of  the  last  q-1  columns  of  T  so  that  the  first  row  of  M 
lies  over  the  first  row  of  T  and  the  last  column  of  M  lies  over  the  column  of 

unknowns.  Suppose  that  k  Is  the  index  of  the  highest  numbered  row  such  that 

a^q  is  not  zero.  Assign  any  values  to  the  first  k  unknowns  so  that  M 
becomes  a  mask  for  the  submatrix  over  which  it  currently  lies.  Next,  move  M 
down  one  row  and  assign  a  value  to  the  unknown  numbered  k+1  so  that  H  Is  a 
gain  a  mask  for  the  submatrix  over  which  it  lies.  Continue  in  this  manner 
assigning  values  to  all  of  the  unknowns.  Then  v  is  the  vector  obtained  from 
those  unknowns. 
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Corollary  III. 16:  Let  T  be  an  m  by  n  matrix  which  satisfies  a  p  by  q 
matrix  mask  M.  Then  there  is  a  doubly  infinite  matrix  T1  such  that  T  is  a 
submatrix  of  T1  and  T1  satisfies  M. 


.  Proof:  We  use  theorem  III. 15  to  add  rows  and  columns  to  T  in  order  to 
get  larger  matrices.  A  simple  Induction  then  allows  us  to  conclude  that  V 
exists. 

We  conclude  this  section  with  our  main  statement  about  finite  matrices. 

Theorem  III. 17:  Let  T  be  an  m  by  n  matrix  over  C  and  let^(x.y)  e  C[x,y]. 

el  ek 

Suppose  ^(x,y)  has  the  unique  factorization  d^x.y)  ...  0k(x,y)  in  C[x,y]. 

Then  T  satisfies  the  polynomial  mask^(x,y)  if  and  only  if  there  exist  m  by  n 

e . 

matrices  U-j . Uk  such  that  satisfies  the  polynomial  mask  0j(x,y)  J  and 

T  =  Ui  +  ...  +  U|< . 

Proof:  Suppose  T  satlsf  ies^  (x,y) .  Then  there  is  us  a  doubly  infinite 
matrix  T1  which  contains  T  as  a  submatrix  and  which  satisf ies ^ (x,y) . 

Decompose  T1  as  a  sum  of  doubly  infinite  matrices  U^j . Uk  such  that  ul 

ei  1 

satisfies  0j(x,y)  J.  Then  let  U  be  the  submatrix  of  U  which  corresponds 

1  e  • 

to  the  submatrix  T  of  T:  Obviously  U.  satisfies  0.(x,y).J  and  T  =  U,  +  ...  +  U.. 

J  J  J  1  J 

The  converse  is  similar. 
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